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[57] 



ABSTRACT 



A spelling corrector develops a phonetic correction 
candidate list and a typographic correction candidate 
list. These lists are merged usiiig similarity values that 
measure the degree of similarity between the query 
(input) word and each word on the candidate lists. The 
similarity values include one based on phonetic compar- 
ison one based on typographic (ASCII, for example) 
comparison, one based on a vowel and consonant typo- 
graphic pattern comparison and one based on a conso- 
nant phonetic comparison. 

11 Claims, 3 Drawing Sheets 
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ELECTRONIC SPELLING MACHINE HAVING 
ORDERED CANDIDATE WORDS 

REFERENCE TO RELATED APPLICATIONS 

This application is a continuation-in-part of patent 
application Scr. No. 199.060 entitled Electronic Spel- 
ling Machine filed on May 25, 1988. now, U.S. Pat No. 
5,113.340 issued May 12, 1992. 



erates. Applicant's references herein to a word similar- 
ity characteristic is to the string comparison function 
theta disclosed in the '811 patent. For convenience this 
function is called the theta function herein. 
^ BRIEF DESCRIPTION OF THE FIGURES 
FIG. 1 is a plan view of a device that incorporates the 
improvement of this invention. 
FIG. 2 is a block and flow diagram indicating the 
10 sequence and relationship of the steps involved in merg- 
ing the phonetic candidate list of words with the typo- 
graphic candidate list of words. 

FIG. 3 is a chart Identifying the theta values desig- 
nated in FIG. 2 and indicating the class boundaries that 
determine into which class a particular candidate word 
is placed. FIG. 3 is to be viewed in connection with 
FIG. 2. 

DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 

FIG. 1 is similar to FIG. 4 of the parent application. 
The preferred technique of selecting candidate words 
from memory based on a query word is the technique 
disclosed in the above-referenced U.S. Pat. No. 



BACKGROUND OF THE INVENTION 

This invention relates to an improved electronic spel- 
ling machine and in particular to one having an en- 
hanced ability to compare an input query word against 15 
a set of words stored in memory. 

The utility of hand held electronic spelling machines 
is essentially in their ability to provide an appropriate 
list of suggested matches against a query word and to do 
so in a minimum of time. Utility is enhanced if the 20 
words presented are ranked in an order that presents the 
most likely word or spelling being sought. 

One way of providing a fairly extensive list without 
overwhelming the user is to provide a list of candidates 



likely match is at the top of the list and the likelihood of 
the match being the one sought decreases as the user 
scrolls down the list. In this fashion, the user typically 
finds the word being sought immediately and only infre- 
quently has to scroll further down the list of candidate 30 
words in order to find the appropriate word. 

Closely related to this issue of providing a meaningful 
ranked list of words is the question of providing both 
typographic corrections and phonetic corrections. That 
is, a query word is put through a first process which 35 
provides a hst of candidate words based on the assump- 
tion that there is a typographic error in the query word. 
The query word is also put through a different type of 
matching process in which the assumption is made that 



device which embodies the present invention is a self- 
contained, battery operated, readily portable, hand 
holdable device 10 having a one line LCD character 
display 11 and a keyboard 12. The keyboard 12 includes 
keys for the twenty-six letters of the alphabet. In addi- 
tion, it has an on switch 13 and a back space key 14. The 
clear key 15 clears the display and permits the user to 
initiate query words. The two keys 16 are scroll keys 
permitting the use to scroll back an forth through the 
word list that has been developed. The enter key 17 is 
actuated after the query word has been completed so 
the device will perform the search function. This in- 
volves a tre traverse to provide a phonetic candidate list 



there are phonetic errors in the input word. For ex- ^ ranked in accordance with a phonetic similarity charac- 



ample, spelling "bell" as either "ball" or as "bel" repre 
sents a typographic kind of error. By contrast, spelling 
"foreign'* as "forun" is a phonetic type of error. 

In any case it is known in the art to provide a correc- 
tion pass which performs a phonetic type of correction ^5 
and another correction pass which performs a typo- 
graphic type of correction. These two passes provide 
two lists of candidate words, specifically a list of good 
phonetic matches and a list of good typographic 
matches. 

Ranking the words in these two lists and merging 
them into a single list provides an enhanced result for a 
user in that on the average it materially decreases the 
time it takes the user to find the desired correctly 
spelled word. 

Accordingly, the purpose of this invention is to pro- 
vide a hand held electronic spelling device which pro- 
vides a candidate list for a query word, which list 
merges the typographic correction list and the phonetic 
correction list and which orders the words in the 
merged list m a enhanced fashion in terms of value to 
the user. 

A preferred embodhncnt of the invention employs 
the enhanced technique for encoding strings as dis- 
closed in the above referenced, U.S. Pat. No. 5.113,340. 

U.S. Pat. No. 4.490.811 dated Dec. 25. 1984 and enti- 
tled String Comparator Device System Circuit and 
Method discloses a string comparator technique of the 
sort with which the improvement of this invention op- 



teristic (a phonetic theta value) and to provide a typo- 
graphic candidate list ranked in accordance with a typo- 
graphic similarity function (an ASCII theta value). 

FIG. 2 is a chart designating the processing flow of 
the word strings being compared and classified and 
ranked. 

A query word is input at step 20. The query word is 
then compared against the set of words in memory by a 
first known type of processing step to provide a list of 
candidates based on a phonetic comparison of the query 
word with the words in memory. This produces the 
phonetic list of candidate words 22. 

A list of typographical candidates is also provided is 
indicated at stage 24. This list is generated by means of 
a known type of typographic comparison between the 
query word and the set of words in memory. 

In this fashion, both a typographic candidate list and 
a phonetic candidate list are prepared. These two lists 
may be generated by any one of a number of different 
^ known techniques. Normally the number of words in 
the list is limited to a predetermined number with the 
highest similarity function. 

The similarity function values are calculated for each 
candidate word in each of two lists 22. 24. These simi- 
larity values are calculated by a technique disclosed in 
the U. S. Pat. No. 4,490,81 1 and, in a preferred embodi- 
ment, incorporate the techniques further disclosed in 
U.S. Pat. No. 5,113,340. The similarity fvmction is the 



50 



55 



09/28/2004, EAST Version: 1.4.1 



5,218,536 

3 4 

theta value mentioned in the '811 patent. It will be con- the phonetic theta PI and the ASCII theta Al for a 

venient herein to call the similarity value, a theta value, word in the phonetic list arc both 126. then that candi- 

As indicated in the stage 26, there are four separate date word is placed in class zero. Similarly if the ASCII 

theta values calculated for each word on the phonetic theta A2 and the phonetic theta P2 for a candidate word 

candidate list and, as indicated at stage 28, there are 5 on the typographic list are also both 126, then that word 

three separate theta values indicated for each word in is placed in class zero, 
the typographic candidate list. 

A phonetic theta, designated PI or P2 is a similarity Class I 

evaluation of the fully encoded phonetic strings. That Class I represents a situation where there is a very 

is, the fully encoded query string and the fully encoded 10 high degree of matching. 

candidate string are compared to provide the theta For a word on the phonetic list 22, if its phonetic 

value PI of a candidate from the phonetic list and the theta PI is 126 and its essence theta £1 is 126, it qualifies 

theta value P2 of a candidate from the typographic list. for class L 

In principle, there is no difference between PI and P2 A word on the typographic bst 24 becomes a member 

except that the two designate which list the theta value IS of class I, as indicated at box 30, if the only difference 

derives from. between the candidate word and the query word is a 

An ASCII theta designated Al or A2 is a theta evalu- vowel transposition. For example, if the query word is 

ation of the ASCII character strings. This is a compari- "neice", the typographic 24 list will show as one of the 

son of the ASCII string of the query word and the candidate words "niece". Since the latter word only 

ASCII character string of the candidate word. The 20 involves a vowel transposition of the query word, that 

ASCII string is the string of letters that make up the word is placed into class L 
word. 

Again, there is no difference in the technique of cal- ^^^^ 

culatingAl and A2 theta values except that A 1 designates Class II is a group of candidate words that are the 

those values for candidate words on the phonetic list 25 next most likely set of words to be sought by the user as 

and A2 designates those values for candidate words on the word corresponding to the input query word. In 

the typographic ist 24. class II, the criteria for the words on the phonetic list 22 

A vowel and consonant pattern theta designated VI is quite different from the criteria for the words on the 

or V2 is a theta evaluation of the vowel and consonant typographic list. 

patterns of the ASCII strings. In this theta, only the 30 As indicated at stage 30, a word from the typographic 

pattern of the vowels and consonants in the words being list 24 is deemed to be in class II if the list word involves 

compared are important. Thus the words are converted cither a vowel substitution as compared with the query 

to strings representing only vowel and consonant pat- word or a vowel addition or deletion adjacent to a 

tern information before the similarity comparison is vowel in the query word. For example, if the query 

made. All the vowels are used as is and all the conso- 35 word is "ball", the typographic list 24 would include 

nants in the world are converted to the later "M**. For "bell". Since this involves a vowel substitution for the 

example, the VI or V2 theta for the words "hello" and query word, the list word *'beir' would be designated as 

"hallow" arc determined by converting these words to being class II. Similarly, if the query word were "nice" 

memmo and mammom. It is the ASCII strings for these one of the typographic list 24 words would be "niece", 

two words which are compared to provide the value of 40 That involves the addition of a vowel adjacent to an- 

VI or V2, other vowel and thus qualifies "niece" as a class II 

Finally, there is an essence theta El which is calcu- word. Similarly, if the query word were "niece", then 

lated only for candidate words on the phonetic list 22. the typographic list word "nice" would qualify as a 

This essence theta El is the similarity evjiluation of the class 11 member because it involves only the deletion of 

essence only phonetic string. That is all of the vowels 45 a vowel adjacent to another vowel, 

are deleted from the phonetic strings of the two words Candidate words from the phonetic list 22 qualify for 

being compared. The two words being compared ar the class 11 if the phonetic theta PI is equal to or greater 

query word and the candidate word from the phonetic than 120 and also if the essence theta El is equal to or 

list 22. greater than 120, Of course, these criteria only apply to 

These theta values, indicated at 26 and 28, are em- 50 words which have not already been classified into 

ployed as the main factor in creating the merged list. higher classes. However for this phonetic list 22 word 

Each candidate word from the phonetic list 22 and the to qualify in class II, a third requirement set forth in 

typographic list 24 is placed in one of seven classes, stage 29 must also be met. 

class zero through class VI, depending upon the theta This third requirement which must be met in order to 
values involved as well as three other particular charac- 55 be in class II is that the word (on the phonetic list) must 
teristics of certain words as set forth at blocks 29, 30 and have its first and last phonetic code identical with the 
31. first and last phonetic code of the query word. That is 
More particularly, as indicated at block 32, the candi- the criterion set forth at stage 29 and is one of the char- 
date words from the lists 22 and 24 are merged into acteristics that has to be processed by the merge algo- 
classes as function of the theta values set forth at 26 and 60 rithm of stage 32 in order for a list 22 candidate word to 
28 as well as a function of certain special characteristics satisfy a class II set membership. 



set forth at 29, 30 and 31. 

Class Zero 



Classes III- VI 



The remaining classes involve less likely candidate 
Class zero indicates the most likely candidate and a 65 words and involve a somewhat simpler classification 
word from the lists 22 and 24 is placed in class zero if it algorithm. Thus, those words on the list 22 and 24 
has the highest possible theta value, which in the em- which have not been classified in the higher class as 
bodiment disclosed herein is the value of 126. Thus if zero, I and II, would be classified in class III if either the 
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phonetic theta PI of a word on the list 22 or the ASCII of decision steps 29, 30 and 31, a determination of class 

theta A2 of a word on the typographic list 24 have a I or class II is decided. For all other candidate words, 

value equal to or greater than 96. which would be the vast majority of candidate words, 

Similarly, for the remaining words on the tw6 lists 22, the class designation for the word is determined by the 

24, if either PI of A2 are equal to or greater than 64, the S theta values for the word in accordance with the crite- 

candidate word would qualify to be in class IV. And for ria indicated below. It should be kept in mind that class 

the remaining words if the PI or A2 are equal to or designations are mutually exclusive and that a word is 

greater than 32, the candidate word would qualify to be placed in the highest class to which it is eligible. For 

in class V. Finally all the remaining candidate words, if example, a candidate word which qualifies for class II is 

any would qualify for class VI. 10 placed in class II even though it would qualify for the 

In addition, to the class designation of each word on criteria set forth in classes III through VI below. Thus, 

the list 22, 24 which is effected at stage 32, each word the criteria set forth in the chart below are the criteria 

on the list 22, 24 are given a rank value. As indicated at for the floor of the class involved. The ceiling of the 

stage 33, the rank value for each word on the phonetic class involved is the floor of the class immediately 

list 22 is one-fourth of the sum of the four theta values IS above. 

Bl, Al, VI and £1 which are calculated for the word. In a prefened embodiment, although not specifically 

The rank value for each word on the typographic list 24 indicated in the FIGs, if the same candidate word is on 

is, as indicated at stage 34, is one-third of the value of the phonetic list and the typographic list, it is only in- 

the sum of the three theta values P2, A2 and V2 which corporated in the merged list one and it is incorporated 

are calculated for each word on the list 24. 20 in a position determined by whatever is the higher class 

These rank values determine the sequence of the for which it qualifies, 

listing of the word within each class but do not affect It should be recognized that one of the similarity 

class membership. All words in class II are listed in rank characteristics (theta value) is calculated for each word 

value with the higher rank words being listed first. during the search routing to determine if the word 

Because class membership takes priority over rank 25 should be put into the candidate list 22 or 24. It is the 

value, a word in class II with a rank value greater than phonetic similarity function that is calculated to deter- 

a word in class I would appear later in the merge list mine if a word is to be placed on the phonetic Hst 22 and 

than would the word in class I. This final arrangement the ASCII similarity function which is calculated to 

of candidate words as a merged list in class order first determine if a word is to be put on the typographic list 

and in rank order within each class is the output, indi- 30 24. In on embodiment each of those lists is limited to 

cated at stage 35, which is made available to the user by seven candidates so that a total of, at the most, fourteen 

being called up on the screen and scrolled through from words have to be considered for this merge routine. In 

most likely word to least likely word. such a case, the sensitivity of the routine for those 

The phonetic correction candidate list 22 tends to be words at the bottom of the list is less significant than is 

a more significant or important list than is the typo* 35 the sensitivity of the routine for the words at the top of 

graphic correction candidate list 24. Thus it would not the list. That is, it is important for the user to very often 

be desirable to simply interleave alternate members of find the word being sought as one of the first two or 

the two lists 22 and 24 in terms of their rank order on three words on the list. This is why the routine for 

their respective lists. However, there are frequently classification in classes I and II is relatively complex, 

cases where the typographic candidate is highly likely 40 The maximum value for the similarity characteristic 

to be the sought for word because of the nature of the theta can be programmed to be any desired value and 

typographic error. By and large, such are cases where the threshold values which qualify a candidate word for 

there are vowel substitutions or vowel inversions or a particular class can be set as desired. To some extent, 

vowel addition or vowel deletion next to another the relative threshold values will be a function of the 

vowel. 45 maximum number of words that can be on the candidate 

Wha the above described technique docs by creating lists 22 and 24. It is important from the point of view of 
certain classes which have priority over the similarity user convenience to have appreciable discrimination in 
function rank calculations is to make it possible to connection with class Zero, I and II. The threshold 
strongly weight the fmal arrangement in favor of the values for the similarity characteristics theta that are 
better phonetic list 22 candidates and yet incorporate in SO described herein is to provide one example of relative 
the uppermost members of the merged list the most values which have been foimd useful in a practical em- 
likely ones of the typographic list candidates. bodiment. 

Thus, a typographic list 24 candidate which has a The ranking within the classes is of importance be- 

maximum phonetic similarity function (theta =126) and cause what happens is that there are situations where, 

a maximum ASCII similarity value (theta =126) will be 55 depending on the query word, a substantial number of 

placed in class Zero. In addition, the vowel substitution candidate words appear in a given class. Experience 

and correction arrangements set forth at stages 30 and shows that there is a better ranking within that class if 

31 will cause typographic candidate from list 24 to be the theta values other than the primary theta value PI 

placed in the uppermost classes I or II. or A2 are given some weight. 

The arrangement then in classes III through VI is 60 This invention has been described in the context of 

substantially a function of the phonetic similarity func- certain types of strings and in particular phonetic strings 

tion of phonetic candidates and the ASCII similarity and ASCII strings. It should be understood that the 

function of typographic candidates. ASCII string refers to one particular typographic string 

The key merging stage is indicated at block 32. This and that this invention can be used with any type of 
involves the placing of each candidate word into a class. 65 typographic string including, for example, the EBC- 

Tbe class is determined as a function of the special DIC string which is the IBM standard. It should also be 

characteristics set forth in decision steps 29, 30 and 31. understood that the phonetic string employed may be 

That is, for all candidate words which meet the criteria any one of the phonetio strings available to those in this 



09/28/2004, EAST 



Version: 1.4.1 



5,218,536 



art which have been used to provide phonetic encoding. 
Since the invention is not limited to any particular pho- 
netic string encoding or alpha-numeric character string 
encoding, such are not described in any greater detail 
herein. 5 

Attached hereto as Appendix A is a presently pre- 
ferred listing in C Source Code. This listing is by way of 
a example of the routines for implementing the merge 
functions of this invention. A skilled programmer may 
implement the invention by means of a different code 10 
listing. 

What is claimed is: 

1. In an electronic machine spelling corrector to pro- 
vide an output list of candidate words in response to an 
input query word, the method of merging a list of pho- IS 
nctic correction candidate words with a list of typo- 
graphic correction candidate words to provide a single 
merged output list for presentation to a user, compris- 
ing: 

for each candidate word on the phonetic list, provid- 20 
ing a first set of similarity values including a first 
value PI vased on a phonetic comparison of the 
candidate word with the query word and a second 
value Al based on a typographic comaprison of the 
candidate word with the query word, 25 

for each candidate word on the typographic list, 
providing a second set of similarity values includ- 
ing a third value P2 based on a phonetic compari- 
son of the candidate word with the query word and 
a fourth value A2 based on a typographic compari- 30 
son of the candidate word with the query word, 

merging candidate words on the phonetic list and 
typographic list into the output list, and 

ordering said candidate words on said merged list as 
a partial function of said four similarity values. 35 

2. The method of claim 1 including the steps of: 
calculating an average of said similarity values for 

each of said candidate words from said phonetic 
and typographic lists, -and wherein: 
said step of ordering includes using said averages as a 40 
partial basis for determining the position of the 
candidate word on the merged list. 

3. The method of claim 1 further comprising the steps 
of: 

for each candidate word on the phonetic list, includ- 45 
Ing in said first set of similarity values, a fifth simi- 
larity value £1 based on an essence comparison of 
the consonants of the candidate word with the 
consonants of the query word, 

assigning to a first priority class, (a) candidate words 50 
from said phonetic list wherein said first similarity 
value PI and said second similarity value Al indi- 
cate a perfect match with the query word and (b) 
candidate words from said typographic list 
wherein said third similarity value P2 and said 55 
fourth similarity value A2 indicate a perfect match 
with the query word, 

assigning to a second priority class, (a) candidate 
words from said phonetic list wherein said first 
similarity value PI and said fifth similarity value £1 60 
indicate a perfect match with the query word and 
(b) candidate words from said typographic list 
which have a vowel transposition compared to the 
query word, 

assigning to a third priority class, (a) candidate words 65 
from said phonetic list wherein (i) said first value 
PI exceeds a first predetermined threshold and (ii) 
said fifth value £1 exceed a second predetermined 
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threshold and Qii) the first and last phonetic codes 
of the candidate words and the query words are the 
same, and (b) candidate words from said typo- 
graphic list, have either (i) a vowel substitution 
compared to the query word or (u) have a vowel 
addition adjacent to another vowel in the query 
word or (iii) a vowel deletion adjacent to another 
vowel in the query word, and 
assigning to one or more lesser priority classes, those 
words wherein (a) said first similarity value PI 
exceeds a predetermined threshold or (b) said 
fourth similarity value A2 exceeds a predetermined 
threshold. 

4. The method of claim 3 including the steps of: 
calculating an average of said similarity values for 

each of said candidate words from said phonetic 
and typographic lists, and 
ordering said candidate words on said merged list 
within each of said priority classes as a function of 
said averages. 

5. The method of claim 3 further comprising the steps 
of: 

for each candidate word on said phonetic list, includ- 
ing in said first set of similarity values, a sixth simi- 
larity value VI based on a vowel and consonant 
pattern comparison of the candidate word with the 
query word, 

for each candidate word on said typographic list, 
includign in said second set of similarity values, a 
seventh similarity value V2 based on a vowel and 
consonant pattern comparison of the candidate 
word with the query word, 

calculating an average of said similarity values for 
each of said candidate words from each of said lists, 
and 

ordering said candidate words within each of said 
priority classes as a function of said averages. 

6. In an electronic machine spelling corrector which 
provides an output list of candidate words in response 
to an input query word, the output list being the merger 
of a list of phonetic correction candidate words and a 
list of typographic correction candidate words compris- 
ing: 

a first class of words comprising (a) candidate words 
from said phonetic list wherein a phonetic compar- 
ison of the candidate word with the query word 
and a typographic comaprison of the candidate 
word with the query word indicates a perfect 
match, and (b) candidate words from said typo- 
graphic list wherein a phonetic comparison of the 
. candidate word with the query word and a typo- 
graphic comparison of the candidate word with the 
query word indicates a perfect match, 

a second class of words excluding words in said first 
class comprisign (a) candidate words from said 
phonetic list whereni a phonetic comparison of the 
candidate word with the query word and an es- 
sence comparison of the candidate word with the 
query word indicates a perfect nxatch, and (b) can- 
didate words from said typographic list which 
have a vowel transposition compared to the query 
word, 

a third class of words excluding words in said first 
and second classes and comprising (a) words from 
said phonetic list wherein (i) a phonetic compari- 
son of the candidate word with the query word and 
an essence comparison of the candidate word with 
the query word exceed a pre-determined threshold 
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and 00 the first and last phonetic codes, of the 
candidate word and the query word are the same, 
and (b) candidate words from said typographic list 
which have a vowel substitution compared to the 
query word or which have a vowel addition or a 5 
vowel deletion adjacent to another vowel in the 
query word, and 
at least a fourth class of words excluding words from 
said first, second and third classes and comprising 
those candidate words from said phonetic list 
whose phonetic comparison with the query word 
exceeds a predetermined threshold and those can- 
didate words from said typographic list whose 
typographic comparison with the query word ex- 
ceeds a predetermined threshold. 

7. The output list of claim 6 wherein: 

position within each of said classes, is determined by 
an average of comparison values calculated from 
each of said phonetic, typographic and essence 
comparisons. 

8. The otuput list of claim 7 wherein: 

said average of comparison values includes a compar- 
ison value based on a vowel and consonant pattern 
comparison of the candidate word with the query 25 
word. 

9. In an electronic machine spelling corrector, the 
output list of candidate words in response to an input 
query word, the output list being the merger of a list of 
phonetic correction candidate words and a list of typo- 30 
graphic correction candidate words, comprising: 

a first set of candidate words comprisign candidate 
words from said phonetic list and said typographic 
list where the phonetic code and typographic code 
for the candidate word are substantially identical to 35 
the phonetic code and typographic code for the 
query word, 
a second set of candidate words comprising: 
(a) candidate words from the phonetic list where 
the phonetic code and the essence code for the 40 
candidate words are substantially identical to the 
phonetic code and the essence code for the query 
word, and 
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(b) words from said typographical list where the 
only difference between the candidate word and 
the query word is a vowel transposition, 
said second set excluding all words in said first set, 
a third set of candidate words comprising: 

(a) candidate words from the phonetic list which 
meet the three criteria that (i) a comparison of 
the phonetic codes of the candidate word and 
query word exceed a first predetermined value, 
(ii) a comparison of the essence codes of the 
candidate word and the query word exceed a 
second predetermined value, and (iii) the first 
and last phonetic codes of the candidate word 
and query word are identical, and 

(b) candidate words from said typographic list 
which have cither (i) a vowel substitution as 
comapred with the query word or (u) a vowel 
addition or deletion adjacent to a vowel in the 
query word, 

said third set excluding all words in aid first and sec- 
ond set, and 
a fourth set of candidate words comprising: 

(a) words from said phonetic list whose phonetic 
code comparison with the phonetic code of the 
query word provides a comparison value that 
exceeds a predetermined value, and 

(b) words from said typographic code whose typo- 
graphic code comparison with the typographic 
code of the query word exceeds a second prede- 
termined value, 

said fourth set excluding all words in said first, second 
or third sets. 

10. The output list of claim 9 wherein: 

position within each of said sets is determined by an 
average of comparison values calculated from each 
of said phonetic, typographic and essence compari- 
sons. 

11. The output list of claim 10 wherein: 

said average of comparison values includes a compar- 
ison value based on a vowel and consonant pattern 
comparison of the candidate word with the query 
word. 
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